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ABSTRACT 


This  paper  is  concerned  with  the  problem  of  diagonally  scaling  a given 
nonnegative  matrix  a to  one  with  prescribed  row  and  column  sums.  The 
approach  is  to  represent  one  of  the  two  scaling  matrices  ais  the  solution 
of  a variational  problem.  This  leads  in  a naturad  way  to  necessary  and  suf- 
ficient conditions  on  the  zero  pattern  of  a so  that  such  a scaling  exists. 
In  addition  the  convergence  of  the  successive  prescribed  row  and  column  sum 
normadizations  is  established.  Certain  invariants  under  diagonal  scaling 
are  used  to  actually  compute  the  desired  scaled  matrix,  and  examples  are 
provided.  Finally,  at  the  end  of  the  paper,  a discussion  of  infinite  systems 
is  presented. 
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SIGNIFICANCE  AND  EXPLANATION 


In  order  to  avoid  heavy  round-off  error  when  solving  large  systems  of 
linecu:  equations  on  the  computer,  it  is  useful  to  scale  the  coefficient  matrix. 
This  scaling  amounts  to  multiplying  the  matrix  by  diagonal  matrices  on  the  left 
and  right.  We  consider  nonnegative  coefficients  and  concern  ourselves  with  the 
problem  of  choosing  appropriate  diagonal  matrices  in  order  that  the  scaled 
matrix  have  certain  prescribed  row  and  coluim  sums.  In  addition,  if  some  of 
the  coefficients  are  zero,  we  eisk  if  this  can  at  all  be  done. 

Our  approach  is  to  replace  the  scaling  problem  by  an  equivalent  problem 
of  finding  the  minimum  of  a certain  function.  This  facilitates  much  of  the 
analysis  and  leads  in  a natural  way  to  necessary  and  sufficient  conditions  on 
the  pattern  of  zeros  in  the  coefficient  matrix.  Included  is  a technique  for 
actually  computing  the  appropriate  scaling  matrices,  a number  of  examples 
illustrating  the  results,  eind  a discussion  of  infinite  systems. 
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Th©  responsibility  for  the  wording  and  views  expressed  in  this  descriptive 
summary  lies  with  MRC,  and  not  with  the  authors  of  this  report. 


A VARIATIONAL  EQUIVALENT  TO  DIAGONAL  SCALING 


Marc  A.  Berger  and  C.  T.  Kelley 


Introduction 


In  this  paper  we  present  a new  approach  to  the  problem  of  diagonally  scaling  a non- 
negative Dxn  matrix  a to  one  with  prescribed  row  euid  coluan  sums  a and  6 ; and 
to  the  question  of  convergence  of  the  successive  a cuid  S row  and  coluan  sum  normali- 
zations of  a . Our  methods  show  this  equivalent  to  the  problem  of  minimizing 

n m 5 . 

n ( I a t 
j=l  i=l  J 

m n 

over  (positive  vectors  in  K ) subject  to 

m 0. 

n t ^ = 1 . 
i=l  ^ 

And  this  leads  us  in  a natural  way  to  conditions  on  the  pattern  of  zero  elements  in  a 
(Theorem  2.D).  The  main  computational  result  underlying  the  theoretical  parts  of  the 
paper  states  that  the  solution  to  the  above  minimum  problem  is  given  by  (3.12). 

In  §1  we  present  the  problem  and  the  tools  employed  to  solve  it.  In  §2  the  existence, 
non-existence,  uniqueness  amd  convergence  results  are  presented  for  finite  systems.  In 
§3  certain  invarlamts  under  diagonal  scaling  are  described,  and  used  to  actually  compute 
the  desired  scaled  matrix.  And  some  examples  are  provided.  Finally,  in  the  last  section 
a discussion  of  infinite  systems,  using  functional  analysis,  is  presented. 

The  results  in  §2  concerning  the  scaling  and  convergence  questions  are  not  new.  The 
original  work  involved  doubly  stochastic  matrices,  and  appears  in  Sinkhom  (7),  Sinkhom 


and  Knopp  [8],  Menon  [5],  and  Brualdi,  Farter  emd  Schneider  [31.  The  generalization  to 
non-square  matrices  and  arbitrary  positive  a amd  6 appears  in  Bacharach  [1],  Brualdi  [2], 
amd  Menon  amd  Schneider  [6] . 


The  approach  of  representing  the  solution  in  terms  of  a variational  problem  is  used 
by  Theil  [9]  and  Gorman  [4].  The  former  minimizes 

‘’ij  V 
ij 

over  positive  matrices  b having  o and  6 as  row  and  coluim  sums.  And  the  latter 
minimizes 

^onsored  by  the  United  States  Army  under  Contract  No.  DAAG29-75-C-0024  amd  the  National 
Science  Foundation  under  Grant  No.  MCS75-17385  AOl. 


I I “ij  e ^ - I o.t  - I 6 s 

i-1  j-1  •’  i=l  j“l 


over  . The  problem  we  formulate,  however,  is  tied  directly  to  the  Menon  operator 

in  [5] , and  leads  in  a most  natural  and  direct  way  to  the  full  admissibility  condition  of 
Theorem  2.D, 


Let  o tuid  6 be  positive  vectors  in  k”*  cuid  e",  respectively,  satisfying 
m n 

(1.1)  i o = I 6 . 

i=l  j=l  J 

And  let  Q denote  the  set  of  nonnegative  matrices  a in  R*"  ® f"  with  no  vanishing 
rows  or  columns.  Define  normalizing  operators  R and  C from  Cl  into  itself  by 

(1.2)  (Ra)..  = a..(J^  a.^)-^.  , (Ca)..  = a.  . ( 


The  reader  can  verify  that 

(1.4)  SflA)  = 3(C)  n 3(R)  5 3(C,R) 

over  the  set  e"'"  of  positive  matrices  in  e”  ® e”,  where  3 denotes  the  set  of  fixed 
points.  Of  course,  over  67  , 3(A)  is  strictly  larger  than  3(C,R).  We  concern  ourselves 
with  the  following  two  problems. 

For  which  a e 67  does  the  sequence 

(P  ) 

^ {A^a}  converge  to  a limit  in  3(C,R),  2md  to  what  limit? 

For  which  a e 67  there  exist  positive  diagonal  matrices  x and  y 

(P.) 

such  that  xay  c 7(C,R),  and  are  they  unique  v?)  to  scalar  multiples? 

Let  E^  denote  the  set  of  positive  vectors  t in  E™.  For  a fixed  a t c7  we 
associate  an  operator  X 5 X(a)  from  E^  into  itself  by 


n ra  1-1 

*ij*j ‘J,  Vk’  ’ 

1-1  ■’  k-1 


This  operator  was  defined  by  Menon  in  [5),  and  was  used  by  Brualdi,  Farter  and  Schneider 
in  [3)  to  analyze  (P^)  when  3(C,R)  is  the  assignment  polytope  £1^  . It  was  also  used 
by  Menon  and  Schneider  in  (6)  to  analyze  (P  ) in  the  more  general  setting  put  forth  here. 
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Vte  note  that  if  it-t  then  the  matrix  b defined  by 


‘ “ij‘ 


is  in  ?(C,R).  And  conversely,  if  b of  the  form  b^^^  “ *ij^i®j  ^ 3(C,R),  then 

lt»t.  Furthermore, 

(1.7)  (A  a)^^  - >i^j 


t°  = ( I a )-^a^  . 
j-1 


And  by  inverting  this  it  follows  that 

(1.9)  (A°).  - n ( I (A^'a) 

^ ^ k-0  J-1 

Thus  A allows  one  to  consider  both  (Pj^)  and  (P^). 

We  show  in  the  next  section  that  solving  At«t  corresponds  to  minimizing 
n m A.  m -a. 

11(1  a,.t.)^  n t.^.  This  leads  us  in  a natural  way  to  conditions  cwi  o and  6 , 

j-1  1=1  ■'  1=1 

relative  to  the  pattern  of  zeros  in  a , such  that  (Pj^)  and  (P^)  may  be  solved.  In  the 
third  section  we  construct  invariants  of  a under  C and  R which  allow  one  to  compute 
the  limit  of  {A**a}.  Finally,  in  the  last  section  we  discuss  the  case  where  one  or  both 
of  ra  and  n are  infinite,  and  solve  analogs  of  (Pj^)  and  (P^)  in  this  setting. 


Consider  the  function  ♦ defined  on  by 

n m S.  m -a. 


‘ I “ij"i’  ''  "i  ' 


j-1  i-1  *■’  * i-1 

The  condition  V log  it(t)  =0  is  equivalent  to  At=t.  Thus  if  $ has  a critical  point 
in  K™  then,  in  accord  with  the  remarks  in  the  previous  section,  there  exist  positive 
diagonal  matrices  x and  y such  that  xay  c 3(C,R).  We  consider  the  problem  of  mini- 
mizing ♦ . It  follows  from  (1.1)  that  ♦ is  homogeneous  in  t , and  thus  it  suffices  to 
iB^se  the  constraint  t t B , where 


{tern™:  I t,  ■ 1 > 

i-1  ^ 
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If  a e then  t tends  to  infinity  as  t approaches  the  boundary  3B  from  inside 

of  B . And  since  ♦ is  continuous  in  B , it  follows  that  a minimum  exists  in  B . 
However)  if  sooie  of  the  elements  of  a vanish*  then  thxs  need  not  be  the  case.  In  fact*  if 
F is  a proper  subset  of  » » a,...,m}  and  F*  is  its  con?>lement,  we  can  examine  the 


behavior  of 

♦(t  + €T  ) as 

e 

e tends  to  zero;  where  t. 

1 

is  zero  for  i £ F 

for  i £ F'  , 

euid  T £ R . 
e + 

with  T . - 1 for  i £ F . 

e.i 

The 

condition  that 

infinite  is 

(2.3) 

I «• 

j£C(F)  ^ 

< J 0. 

i£F  ^ 

where 

(2.4) 

C(F)  = { 

i : a,  . * 0 for  each  k £ F' 
kj 

} . 

Furthermore , 

note  that  C (F) 

is  empty  unless  F contains 

one 

of  the  sets 

(2.5) 

II 

H* 

o 

A 

Theorem  2. A: 

Let  a £ ^ such  that  (2.3)  holds  for  all  proper  subsets  F which  are  unions  of 

the  sets  F^  in  (2.5).  Then  there  exist  positive  diagonal  matrices  x and  y 
such  that  xay  e Jf(C,R)  , and  they  are  \mique  up  to  sccd.ar  multiples. 


Proof: 

* 

First  we  estedilish  (2.3)  for  any  proper  subset  F . Let  F be  the  union  of  the  F 

contained  in  F . Then  F*  c F and  C(F*)  = C(F).  Therefore  (2.3)  must  hold  for  F , 

* 

since  it  holds  for  F 

Let  t*  £ 3b.  Then  lim  ♦(t  + et  ) = “ whenever  t + £ r”  . Thus  it  follows 

e+0 

that  lim^  ♦(t)  * “.  This  meems  that  ♦ has  a minimum  in  B , and  thus  that  x and  y 
t •♦t 
exist. 

To  establish  uniqueness  it  suffices  to  show  that  if  a and  xay  are  in  5(C,R) 

'u  % '\/  Orf 

where  x euid  y are  positive  diagonal  matrices  then  a = x a y . 


(2.6) 


— O/  T/'v  n'w  'w'w 

k ■ Vj’  - ' Vj’  ■ ° • 


i-1 


-4- 


H 

i ■ 

■f  : 


'Vi  'V  . 1 ^ 

Scale  X and  y so  that  each  ^ 1 ^nd  each  y.  ^ 1,  and  at  least  one  element  is 

'V.  'V;  ‘Vi 

equal  to  one.  Say  x.  » 1.  Then  it  follows  from  (2.6)  that  y.  * 1 whenever  a . > 0. 

% 'V*  'Vi  'V» 

Thus  a,  . = a.  . x y.  for  every  j , and  moreover,  some  y.  » 1.  Then  similarly 

a,  . » a.  . x.y.  for  every  i . That  is,  the  i_-th  row  iuid  j„-th  column  of  a - xa 
ij„  i j„  '0  -’O 

0 0 0 

vanish.  So  we  delete  them  iuid  consider  the  (m-1)  x (n-1)  case.  Continuing  this  reduc- 

n.  % 

tion,  eventually  either  one  or  both  of  x and  y reduce  to  a single  element.  And  this 

case  is  trivial.  ■ 

Theorem  2.B: 

Under  the  conditions  of  Theorem  2.  A,  the  sequence  {A*^a}  converges  to  x a y . 

Proof: 

It  follows  from  (1.7)  euid  the  uniqueness  result  that  the  convergence  of  {X^t^}  to 

N 

an  interior  fixed  point  t of  X , implies  the  convergence  of  {A  a}  to  xay.  Suppose 
first  that  there  exists  p such  that  (aa)^  e F™'"  where  a denotes  the  transpose 
of  a . Then  for  any  distinct  vectors  u eind  v in  JR™  with  u v , we  not  only  have 

p p 

Xu  ^ Xv  , but  (X  u)^  < (X  v)^  for  every  i . Now  let  t be  an  interior  fixed  point  of 

X , and  let  [0^,6^]  be  the  smallest  interval  such  that  a^t  ^ t°  ^ B^t  . Then 

Opt  ^ Xt°  ^ Bpt  and  we  can  reduce  the  interval  [u^.B^l  to  the  smallest  such 

that  Oj^t  ^ Xt^  ^ ®l''‘  Continuing  in  this  manner  we  generate  a sequence  of  nested  inter- 
vals such  that 

(2.7) 


1 < B^t 


and  [cijj,  Bj^l  is  the  smallest  such  interved..  This  nested  sequence  converges  to  [o,Bl  , 

and  it  suffices  now  to  show  that  a ==  B. 

* \ 0 

So  suppose  a ji  & and  let  t = lim  X t be  any  cluster  point  of  the  sequence 

k -►  <» 

NO  * ^ 

{X  t }.  Then  at  ^ t ^ Bt  and  [a,B]  is  the  smallest  such  interval.  But  then 
0 

t = lim  X t is  cuiother  cluster  point,  and  at.  < t,  < Bt,  for  every  i ; and  this 

k-x»  1X1 

contradicts  the  minimality  of  [a,Bl.  Thus  a = B. 

Sippose  next  that  no  powers  of  aa  are  positive.  Then,  since  aa  has  a positive 
diagonal,  it  cem  be  written  as  a direct  sum  of  bloc)cs  b over  subspaces 

q 

IR™  = {t  e r"  : t.  = 0 , i € F } 
q i q 

-5- 


where  F c s . amd  each  b has  a positive  power.  And  X can  accordingly  be  deconposed 
q m q 

into  operators  X acting  on  IR*  , the  set  of  positive  vectors  in  K . Thus  it  suffices 
q q»+  H 

to  establish  the  convergence  of  { t°''*},  where  is  the  corresponding  restriction  of 

q 

t^  to  JR™  . Furthennore,  this  will  follow  iamediately  from  the  first  part  of  this  proof, 

q 

as  long  as  each  X has  fixed  points  in  R**  . And  this  is  inaediate  from  HieoreB  2. A, 

q qr* 

which  establishes  fixed  points  for  X in  R**  .■ 

Theorems  2. A and  2.B  solve  (P^^)  and  (P^)  under  conditions  (2.3).  In  particular  we 

note  that  under  these  conditions  there  always  exists  a matrix  b e 31C,R)  such  that 

a.  = 0 if  and  only  if  b.  . = 0.  What  we  show  below  is  that  if  there  exists  any  F such 
13  13 

that 

(2.8)  I > I Oi 

j£C(F)  ieF 

then  no  such  b exists,  even  if  we  allow  b to  have  extra  zeros.  And  therefore  for  these 
a £ jio  diagonal  matrices  x and  y exist  for  (Pj).  And  (A  a)  does  not  converge  to 
a limit  in  ?(C,R). 


Theorem  2.C: 

Suppose  a e C7  ^ such  that  (2.8)  holds  for  soto  F . Then  for  no  b £ <7  n 3(C,R) 

is  it  true  that  a. . = 0 implies  b. . =0  . 

1]  — ^ 13 

Proof: 

Suppose  such  a b exists.  Then  (2.8)  holds  for  b even  if  b has  more  zeros  than 
a , because  this  will  only  serve  to  enlarge  C(F).  But  then 

I a,  = I Ib  > I I b = I I ‘>13=  I I I «3 

i£P  ieFj=l  ^ i£Fj£C(F)  ^ j£C(F)  i£F  j£C(F)i=l  j£C(F)  ■’ 

which  is  a contradiction.  ■ 

Finally  we  wish  to  consider  the  case  where 


(2.9) 


I s = I 

j£C(F)  •'  ifF 


for  some  proper  subset(s)  F , (2.3)  holding  for  the  others.  In  this  case  in  order  that 
X and  y solving  (P^)  exist,  it  is  necessary  that 


(2.10) 


a^^  = 0,  i€F,jeC(F)’ 


for  those  F satisfying  (2.9).  This  is  sin?>ly  because  it  is  necessary  that  = 0 


-6- 


i £ F , j € C(F)’.  The  result  below  shows  that  these  conditions  are  sufficient  as  well. 
In  order  to  avoid  cumbersome  hypotheses  we  summarize  these  conditions  in  a definition. 
Thus  a £ 47  will  be  called  fully  admissible  if  (2.3)  or  (2.9)  hold  for  every  proper  sub- 
set F ; and  (2.10)  holds  for  those  F satisfying  (2.9). 


Theorem  2.D: 

a £ 47  ^ fully  admissible  if  and  only  if  there  exist  diagonal  matrices  x and  y 

solving  (P^).  And  under  these  conditions  {A  a}  converges  ^ xay  . 

Proof: 

Sufficiency  has  ed.ready  been  established  in  the  discussion  above.  To  prove  necessity 
we  can  assume  (2.9)  holds  for  some  proper  subset  F . Otherwise  the  existence  of  x and 
y follows  directly  from  Theorem  2. A.  Because  of  (2.10)  we  can  reduce  the  problem  to  the 
two  sub-matrices  ^ ^ 3 ^ C(F))  and  (a^^  : i £ F' , j £ C(F)').  Both  of  these 

sub-matrices  are  fully  admissible  euid  their  dimensions  are  strictly  less  than  those  of  a 
Thus  an  induction  establishes  the  existence  of  diagonal  matrices  x'  and  y'  , x"  and  y 
solving  (P^)  for  these  smaller  matrices.  And  x'  ,x"  and  y'  ,y"  Ccin  be  pieced  together 
to  construct  diagcxial  matrices  x and  y solving  (P^)  for  a . 

The  vector  (x^)  is  a fixed  point  of  A , and  its  existence  is  enough  to  justify  the 
proof  of  Theorem  2.B.  ■ 


If  a £ 47  is  not  fully  admissible,  but  is  such  that  (2.3)  or  (2.9)  hold  for  every 
proper  subset  F , then  a is  said  to  be  rartially  admissible.  For  such  a matrix  a , 
there  will  not  exist  b e 47  n3(C,R)  hav  ing  the  same  zero  pattern  as  a . (But  if  we 
allow  b to  have  extra  zeros,  then  such  matrices  b will  exist.)  Thus  Theorem  2.D 
amounts  to  saying  that  whenever  there  exists  a matrix  in  47  n 3(C,R)  having  the  same  zero 
pattern  as  a , then  x and  y exist,  and  conversely.  Hence  t defined  by  (2.1)  has 
critical  points  in  if  and  only  if  a is  fully  admissible.  Finally  we  note  that  for 

the  doubly  stochastic  case,  where  m=n  and  a = 6 = 1 for  each  i and  j , full  admis 

^ j 

sibility  reduces  to  a cardinality  condition. 


(3.1) 


Osserve  that  if  it  is  a k-cycle  then 


k 

n 


-1 


p«i  ^p  ’’jp 

is  invariant  under  C and  R , provided  it  is  well-defined.  In  fact  it  is  invariant 

N 

inder  euiy  diagonal  scaling  xay.  These  ratios  can  be  used  to  compute  limA  a when 
it  exists,  as  illustrated  by  the  following  exanple. 

Example  3. A: 


Let 


f ° 
V. 


12 

0 

^32 


and  a.  = 6 . = 1 for  each  i and  j . Then 
t 3 

^ " ®12®23®31®13®32®21 

is  invariant.  Thus 


lim  A a = 

N 00 

1^  1 
3f,  . 3vl 


where  p = r^(l+r  ) . And  a diagonal  scaling  that  leads  to  this  is 


( . ®13  p *12  1-PI 

X = diag[  1 . . I 

23  32 

y = diag(  A ^ 

*31*12  ' *12'  *13 

Finally,  for  general  0 and  6 conditions  (2.3)  become 

^1  ^ ‘’2  °3  , '*2  ‘ + '^3  , *3  ^ °1  ^ °2- 

From  now  an  we  assume  that  a has  a positive  row  and  column.  Without  loss  of  gener- 
ality these  may  be  the  first  row  amd  column.  Then  we  cam  define  a ratio  matrix  r by 

,,  -1  -1 
(3.2)  r.  . = a.  .a,,a.,a  , . 

1]  13  11  il  13 


This  matrix  is  invariant  under  C amd  R , and  can  be  used  to  define  analogues  of  X 


and  ♦.  By  writing  the  diagonal  matrix  x of  (P^)  in  the  form  x^  = t^  a^j^  we  are  led 
to  consider  the  operator  A defined  by 


(At)^  = [ 


il  iU 

I r. .6.  ( I r t 
j=l  ^ k=l 


) 1 a.. 


If  At=t  then  the  matrix  b defined  by 


b.  . = r.  . ( 
13  13  , 


*"  -1 
I r t ) t.6.  = a.  . (Y  a^.t  a )t.a  6. 

k]  k 13  1]  Kj  k kl  1 il  3 


is  in  ^(C.R);  and,  conversely,  if  b of  the  form  b^^  = j i®  i"  S(C,R)  then 

At=t.  And  the  corresponding  relationship  between  iterates  of  A and  A is 
(3.5)  (A\,^.=r^.a 


^0  0 

(3.6)  t.  = a.^t.  . 

And  by  inverting  this  it  follows  that 

(3.7)  (a\°).  = a n [ I (A’"a)..]"^  . 

^ ^ k=0  j=l 

Similarly  the  corresponding  function  to  minimize  is 

A.  n m . m . 

(3.8)  ■Kt)  = n ( y r.  .t. ) ^ n t.  ^ . 

j=l  i=l  ^ i=l  ^ 

Prcblem  (P^)  amounts  to  seeking  a matrix  b with  the  same  ratio  matrix  as  a , having 

prescribed  row  and  column  sums.  And  the  uniqueness  result  in  Theorem  2. A amounts  to 

saying  that  distinct  matrices  with  identical  row  and  column  sums  have  distinct  ratio 

matrices.  Finally,  we  note  that  because  of  (1.1)  both  i and  $ are  homogeneous  in  t, 

and  thus  it  suffices  to  minimize 

n m 6. 

(3.9)  t(t)  = n ( y a,  .t.)  ^ 

. , • , ID  1 

3=1  3=1 


» n m 6. 

'l'(t)  = n ( I r.  .t.)  ^ 
j=l  i=l  ^ 


over  , subject  to 


m a. 
n t/  = 1 
i=i  ^ 


The  solution  to  the  former  is 


t*  = c n o.  I y (A ’'a)  . 

1 k=0  ^ j=l 


and  the  solution  to  the  latter  is 


t . = c a. ,t. 

1 il  1 


where  c emd  c are  appropriate  scale  factors.  The  remainder  of  this  section  is  devoted 
to  examples. 

Example  3.B: 

For  the  case  m=n=2  the  limit  is 


where  p is  a nonnegative  solution  of 

2 '^22°!  ^"2.  ’^22^1  „ 

^ - <«1  + r -1~  r ° ° 

22  22 

provided  a is  nonsingular.  Otherwise  u = ^22”*^  condition  (2.3) 

becomes  '^2  ^ *^1  ' ^2  ~ °1  p = 0.  The  corresponding  variational  problem  is  to 

^1  *2  -°1 
minimize  (s+l)  ^®‘*'^22^  ^ 


Exanple  3.C: 


For  the  case  m=2 , n=3  the  limit  is 


/ P n o^-p-u  1 

I *1”^  *2”^  P+ht6^-Oj^  j 

where  y,n  are  nonnegative  solutions  of 

, '2  ^22 \ 

2 . . “^32 VS.  *^32^  , S2SS  , 

p +pii-(a  + — - — ^)p  - - — ::Yn  + — — — 0 . 

^ 32  ^32  32 

These  reduce  to  a single  cubic  in  p . If  r^j  = 1 then  the  first  equation  becomes 

n = 6 6 ^p  . If  r = 1 then  the  second  equation  reduces  to  (6  + 6 )p  + 6 n = o,  5.  . 

2 1 32  J.  J J X i. 

And  if  ^ ^23  ” ^ ^ ^ °lS^°l  * '’2'  ^ ^ °lS^V°2*  ^ corres- 

ponding variational  problem  is  to 


S S '^3  ■°i 

minimize  (s+l)  ^^''■'^22^  ^^^^23^  ® 


Here,  as  before 


Example  3.D: 

For  the  case  m=n=3,  0^“  ^ ^ ’'^23“^32“’^22^33  °’^ 

r ^=r,  =r„,r  „ then  the  limit  is  a permuted  matrix  (i.e.  all  rows  and  columns  are 

22  33  23  32 

permutations  of  one  another).  In  the  former  case  it  is  symmetric;  in  the  latter,  s)cew- 
symmetric. 


Example  3.E: 


The  conditions  (2.3)  iDCCome 

°2  ""  *l'^*4  ' °4  ^ '^l‘^'^2  ' S * ®l'^°3  ■ 

Thus  we  Ccin  consider  the  case  = 6^  = 1 for  each  i and  j . The  corresponding 
variational  problem  is  to 

minimize 


subject  to 


By  symmetry  t =t  and  t =t  . Thus  we  need  to  minimize 

2 3 2 

Letting  s = t^  the  condition  for  a minimum  is  8s  + 8s  = 1.  The  only  positive  solu- 
tion is  i(/F-l).  Thus 

4 

tj^  = t j = i-  ■ *"2  “ '"4  “ ■ 

This  means  that  diagonal  scaling  matrices  are 


X = diag(tj^,t2,t^,t^) 


= i diag{/5/5-ll , t , /2/5-i  , /2/5-4  ) 


-11- 


and  the  limit  x a y is 


S4.  Scaling  Infinite  Matrices 

In  this  section  we  consider  the  sceiling  of  infinite  matrices;  i.e.  one  or  both  of  m 

and  n is  infinite.  We  assume  n * and  that  o € H and  6 € # are  positive 

vectors  satisfying  (1.1).  Denote  by  f”  the  B^mach  space  of  infinite  mxoo-matrices  a 

having  real  entries  such  that 
“ m 

(4.1) 


I I 

j=l i-1 


|P  = 


Note  that  f"  is  a Hilbert  space  for  emy  m (even  m=<»)  . Let  a £ )»  positive,  and 


consider  the  following  analogue  of  (P^) : 

Find  t £ k"  and  s e such  that  the  matrix  b given  by  b^ ^ « a^^t^^s^ 


satisfies 


(P^)- 


I b.  . ” Oj 
ID  i 


l,...,m  ; 51  b^^-  , j - 1,2, 


j-1 


i-1 


Itie  matrix  b e is  sedd  to  be  a solution  of  (Pj)'  - 


Theorem  4. A 


(Pj)'  has  at  most  one  solution. 


Proof: 


^ '\j  ^ ^ 

As  in  the  proof  of  Theorem  2. A,  we  assume  that  a and  solutions  of 


(4.2) 


m 

^ = J,  “i)c\ 
)l”l 


m,m 


vriiere  a £ H'"'"'  is  given  by 


-12- 


■3 


f.  ' 


i • 


(4.3) 


-I'V-  n "X,  -X/  ^-i'X, 

“ik  = °i  4 • 


'X-  "X/  ,-1'X, 


We  note  that  a satisfies 
m 


(4.4)  I a..^  = 1 , i = l,...,m  ; I " °y  ' ^ • 

k=l  i=l 

By  Jensen's  inequality  ^ 

-t . m 

(4.5)  Yi  - ® ^ 1 I “hv^v  '■  i = l,...,in  . 

1 k=l 

If  m = “>,  note  that  the  sum  on  the  left  of  (4.5)  converges, as  f.  1-  We  have 

\ ' 


m 

(4.6) 

I 

i=l 

and  so 

(4.7) 

k=l  i=l 


^i  = “ik\ 
k=l 


k=l 


i = 1 , . . . ,m 


But  of  course  the  same  argument  can  be  used  to  show  that 

m 

(4.8)  y"  = I oi.^y"  : i = l,...,m 

1 ,^=1  Ik  k 

for  any  integer  n ^ 0.  And  by  taking  limits  of  sums  it  follows  that 

(4.9) 


, x2  , i2 

-(Y.-Y^)  m -(Y^-Y^) 

e = I a .^e 


k=l 


ik 


"X,  "X, 

If  t^  ^ tj^  for  some  i , then  for  all  i = l,...,m 

ra  -(Yj^-Y,)^ 

(4.10)  I a e < 1 . 

k=l 

Since,  by  (4.9),  the  inequality  (4.10)  cannot  hold  for  i=l,  it  must  indeed  be  true  that 
■x, 

t is  a constcuit  vector.  ■ 

To  consider  the  question  of  existence , suppose  first  that  m < ».  For  N ^ 1 let 

m /N  N m N 

p..  = y a.  / } 6.  . According  to  Theorem  2. A there  is  a unique  matrix  a e TR  ' 

^ i=i  y j=i  3 ^ ^ + 

N N N 

such  that  a.  . = a.  . t.  s.  and 
13  13  1 3 


nieoreni  4.B: 


For  the  case  m < »,  the  sequence  {b  } defined  by  (4.12)  converges  in  the  norm 


topology  of  ^ the  solution  b of  (P^ ) ' 


Proof: 


The  sequence  (b**)  is  boxmded  in  !■”  and  as  such  has  a subsequence  {b  which 


converges  weakly  to  an  element  b e Clearly  b^^  .1  0 > i = 1» 


j = 1,2,... 


Define  functionals  f.  on  Z by 

j 2 


(4.13) 


f (z)  = J 


z e Z^ 


i=l 


By  taking  weak  limits  it  follows  that  f . (b)  = 5.  . Thus  b e and  {b  converges 


to  b in  the  norm  topology  of  Z 


r^^  by  (3.2).  Then,  again  by  taking  limits, 


And  J b.,=o.  ; i=l,...,m 

j=l  ^31 


As  in  §3  define 


(4.14) 


b.  ,b  = r.  . b b,  . ; 
13  11  13  il  13 


i “ l,...,ro,  3 ~ 1,2, .< 


And  this  shows  that  b is  a solution  to  (Pj)'. 


If  we  write  b. . = a. .t.s.  then,  as  before,  Xt=t  , where  X is  given  by  (1.5). 

13  13  1 3 -3  I ! 


And  if  R,C  and  A are  defined  on  by  (1.2)  zmd  (1.3),  then  the  same  proof  used  for 


Theorem  2.B  can  be  used  here  to  establish  the  following  result, 
nieorem  4.C: 


.N 


set  <, 


The  sequence  {A  a}  converges  in  to  the  solution,  b , of  (Pj) ' • 

For  the  case  m = ” the  solution  to  (P  ) ' can  be  constructed  as  follows.  For  n ^ 
. N ^ 

r / r N N 

: = 2.  ^ - / L • According  to  Theorem  4. A there  is  a unique  act  of  the 

j=l  y i=l  ^ 


form  a, . = a. .t.s.  such  that 
13  13  1 3 


(4.15) 

As  before,  define 


1 

j=l 


N 
a.  . 

13 


Vi  ' ^ ^ ” 


i a.  . -6  , j = l,2,... 

i=l  ^ ^ 


(4,16) 


i < N 


i > N 


-14- 


An  argument  similar  to  the  one  used  to  prove  Theorem  4.B  establishes  the  following 


the  sequence  {b  ) defined  by  (4.16)  converges  in  the  norm 
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